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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/313 , 434A 



DATE: 04/17/2001 
TIME: 14:14:02 



RECEIVED 

APR 3 0 2001 

]E CH CENTER 1600/2900 




1645 



4 <110> 
6 <120> 
9 <130> 

11 <140> 

12 <141> 

14 <150> 

15 <151> PRIOR 

17 <150> PRIOR 

18 <151> PRIOR 

20 <150> PRIOR 

21 <151> PRIOR 

23 <150> PRIOR 

24 <151> PRIOR 

26 <150> PRIOR 

27 <151> PRIOR 
29 <160> NUMBER 
31 <170> 

<210> SEQ 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 



Input Set : A:\00786-432001.TXT 

Output Set: N:\CRF3\04172001\I313434A.raw 

APPLICANT: Podolsky, Daniel K . 

TITLE OF INVENTION: INTESTINAL TREFOIL PROTEINS 
FILE REFERENCE: 00786-432001 

CURRENT APPLICATION NUMBER: US 09/313, 434A 
CURRENT FILING DATE: 1999-05-17 
PRIOR APPLICATION NUMBER: US 08/631,469 
FILING DATE: 1996-04-12 
APPLICATION NUMBER: US 08/191,352 



ENTERED 



FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
OF SEQ ID 



1994-02-02 
NUMBER: US 08/037,741 

1993-03-25 
NUMBER: US 07/837,192 

1992-02-13 
NUMBER: US 07/655,965 
: 1991-02-14 
NOS: 21 



SOFTWARE: FastSEQ for Windows Version 4.0 

ID NO: 1 
LENGTH: 431 
TYPE: DNA 

Rattus 



33 
34 
35 
36 
38 
39 
40 
42 
43 
44 
45 

47 ctg gtc ctg gtt get 

48 Leu Val Leu Val Ala 



ORGANISM: 
FEATURE : 
NAME/KEY 
LOCATION 



norvegicus 



<400> SEQUENCE 



CDS 
(18) 
1 



(260) 



gaagtttgcg tgetgee 



49 
51 
52 
53 
55 
56 
57 
59 
60 
61 
63 
64 
65 
67 
68 
69 
71 



eta 
Leu 

aac 
Asn 

ttt 
Phe 
60 
gag 
Glu 



tct 
Ser 



cca 
Pro 
30 
ccc 
Pro 



tac 
Tyr 
45 
gac tec 
Asp Ser 



aca 
Thr 



gaa 
Glu 



15 
age 
Ser 

act 
Thr 

age 
Ser 

tgt 
Cys 



caa 
Gin 

gtc 
Val 

ate 
He 

aca 
Thr 
80 



atg 
Met 
1 

ggg 

Gly 

tgt 
Cys 

aca 
Thr 



gag 
Glu 

tec 
Ser 

atg 
Met 



acc 
Thr 

tec 
Ser 

gcg 
Ala 
35 
gag 
Glu 



aga 
Arg 



gee ttc tgg ata acc ctg ctg 
Ala Phe Trp He Thr Leu Leu 
5 10 
tgc aaa gee cag gaa ttt gtt ggc 
Cys Lys Ala Gin Glu Phe Val Gly 

20 25 
cca aca aat gtc agg gtg gac tgt 
Pro Thr Asn Val Arg Val Asp Cys 

40 

tea gag cag tgt aac aac cgt ggt tgc tgt 
Ser Glu Gin Cys Asn Asn Arg Gly Cys Cys 
50 55 
cca aat gtg ccc tgg tgc ttc aaa cct ctg caa 
Pro Asn Val Pro Trp Cys Phe Lys Pro Leu Gin 



65 



70 



75 



ttt tgaagctgtc caggctccag gaagggagct 
Phe 



ccacaccctg gaetcttget gatggtagtg geccagggta acactcaccc ctgatctget 
ccctcgcgcc ggecaatata ggagctggga gtccagaaga ataaagacct tacagtcagc 
acaaggctgt tetaattgeg g 
<210> SEQ ID NO: 2 



50 



98 



146 



194 



242 



290 



350 
410 
431 
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72 <211> LENGTH: 81 

73 <212> TYPE: PRT 

74 <213> ORGANISM: 

76 <400> SEQUENCE: 

77 Met Glu Thr Arg 
1 

Gly Ser Ser Cys 

20 

Cys Met Ala Pro 
35 



norvegicus 



78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
90 
91 
92 
93 
95 



Rattus 
2 

Ala Phe Trp lie Thr Leu 

5 10 
Lys Ala Gin Glu Phe Val 



Leu Leu Val Leu 



Val 
15 
Ser 



Ala 



Gin 



Phe Val Gly Leu Ser Pro 
25 30 
Thr Asn Val Arg Val Asp Cys Asn Tyr Pro Thr Val 

40 45 
Thr Ser Glu Gin Cys Asn Asn Arg Gly Cys Cys Phe Asp Ser Ser lie 

50 55 60 

Pro Asn Val Pro Trp Cys Phe Lys Pro Leu Gin Glu Thr Glu Cys Thr 
65 70 75 80 

Phe 



<210> SEQ ID NO: 3 
<211> LENGTH: 403 
<212> TYPE: DNA 
<213> ORGANISM: Homo sapiens 
<220> FEATURE: 

96 <221> NAME/KEY: CDS 

97 <222> LOCATION: (2)... (223) 
99 <400> SEQUENCE: 3 

100 



101 
102 
104 
105 
106 
108 
109 
110 
112 



g v atg ctg ggg ctg gtc ctg gcc ttg ctg tec tec age tct get gag 
Met Leu Gly Leu Val Leu Ala Leu Leu Ser Ser Ser Ser Ala Glu 



gag 
Glu 



tac gtg ggc ctg tct gca aac 
Tyr Val Gly Leu Ser Ala Asn 

20 

gtg gac tgc ggc tac ccc cat 
Val Asp Cys Gly Tyr Pro His 
35 

ggc tgc tgc ttt gac tec agg 



cag 
Gin 

gtc 
Val 
40 
ate 



tgt 
Cys 
25 
acc 
Thr 



10 
gcc gtg 
Ala Val 



ccc 
Pro 



aag 
Lys 



ccg 
Pro 

gag 
Glu 



cct gga gtg cct 



gcc 
Ala 

tgc 
Cys 
45 
tgg 



15 

aag gac agg 
Lys Asp Arg 
30 

aac aac egg 
Asn Asn Arg 

tgt ttc aag 



113 Gly Cys Cys Phe Asp Ser Arg lie Pro Gly Val Pro Trp Cys Phe Lys 

114 50 55 60 

116 ccc ctg act agg aag aca gaa tgc acc ttc'tgaggcacct ccagctgccc 

117 Pro Leu. Thr Arg Lys Thr Glu Cys Thr Phe 

118 65 70 

120 ctgggatgca ggctgagcac ccttgcccgg ctgtgattgc tgccaggcac tgttcatctc 

121 agtttttctg tccctttgct cccggcaagc tttctgctga aagttcatat ctggagcctg 

122 atgtcttaac gaataaaggt cccatgctcc acccgaaaaa 

124 <210> SEQ ID NO: 4 

125 <211> LENGTH: 74 

126 <212> TYPE: PRT 

127 <213> ORGANISM: Homo sapiens 

129 <400> SEQUENCE: 4 

130 Met Leu Gly Leu Val Leu Ala Leu Leu Ser Ser Ser Ser Ala Glu Glu 

131 15 10 15 

132 Tyr Val Gly Leu Ser Ala Asn Gin Cys Ala Val Pro Ala Lys Asp Arg 



49 



97 



145 



193 



243 



303 
363 
403 . 
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133 20 



25 30 



134 Val Asp Cys Gly Tyr Pro 

135 35 



His Val Thr Pro Lys Glu Cys Asn Asn Arg 



136 Gly Cys Cys Phe Asp Ser 

137 50 



40 45 
Arg lie Pro Gly Val Pro Trp Cys Phe Lys 



138 Pro Leu Thr Arg Lys Thr 

139 65 70 



55 60 
Glu Cys Thr Phe 



141 <210> SEQ ID NO: 5 

142 <211> LENGTH: 10 

143 <212> TYPE: DNA 

144 <213> ORGANISM: Artificial Sequence 

146 <220> FEATURE : 

147 <223> OTHER INFORMATION: motif 

149 <400> SEQUENCE: 5 

150 gggcggccgc 10 

152 <210> SEQ ID NO: 6 

153 <211> LENGTH: 21 

154 <212> TYPE: DNA 

155 <213> ORGANISM: Artificial Sequence 

157 <220> FEATURE: 

158 <223> OTHER INFORMATION: oligonucleotide for PCR 

160 <400> SEQUENCE: 6 

161 gtacattctg tctcttgcag a 21 

163 <210> SEQ ID NO: 7 

164 <211> LENGTH: 24 

165 <212> TYPE: DNA 

166 <213> ORGANISM: Artificial Sequence 

168 <220> FEATURE: 

169 <223> OTHER INFORMATION: oligonucleotide for PCR 

171 <400> SEQUENCE: 7 

172 taaccctgct gctgctggtc ctgg 24 

174 <210> SEQ ID NO: 8 

175 <211> LENGTH: 21 

176 <212> TYPE: DNA 

177 <213> ORGANISM: Artificial Sequence 

179 <220> FEATURE: 

180 <223> OTHER INFORMATION: oligonucleotide for PCR 

182 <400> SEQUENCE: 8 

183 gtttgcgtgc tgccatggag a 21 

185 <210> SEQ ID NO: 9 

186 <211> LENGTH: 21 

187 <212> TYPE: DNA 

188 <213> ORGANISM: Artificial Sequence 

190 <220> FEATURE: 

191 <223> OTHER INFORMATION: oligonucleotide for PCR 

193 <400> SEQUENCE: 9 

194 ccgcaattag aacagccttg t 21 

196 <210> SEQ ID NO: 10 

197 <211> LENGTH: 25 
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198 
199 
201 
202 
204 
205 
207 
208 
209 
210 
212 
215 
216 
217 
218 
219 
220 
221 
222 
223 
225 
226 
227 
228 
230 
232 
233 
234 
235 
236 
237 
238 
239 
240 
242 
243 
244 
245 
247 
248 
249 
251 
252 
253 
254 
256 
257 
258 
260 



Artificial Sequence 

oligonucleotide for PCR 



<212> TYPE: DNA 
<213> ORGANISM: 
<220> FEATURE: 
<223> OTHER INFORMATION 
<400> SEQUENCE: 10 
gcagtgtaac aaccgtggtt gctgc 
<210> SEQ ID NO: 11 
<211> LENGTH: 60 
<212> TYPE: PRT 
<213> ORGANISM: Homo sapiens 
<220> FEATURE: 
<400> SEQUENCE: 11 

Glu Ala Gin Thr Glu Thr Cys Thr Val Ala Pro Arg Glu 

15 10 
Cys Gly Phe Pro Gly Val Thr Pro Ser Gin Cys Ala Asn 

20 25 
Cys Phe Asp Asp Thr Val Arg Gly Val Pro Trp Cys Phe 

35 40 45 

Thr lie Asp Val Pro Pro Glu Glu Glu Cys Glu Phe 

50 55 60 

<210> SEQ ID NO: 12 
<211> LENGTH: 62 
<212> TYPE: PRT 
<213> ORGANISM: Homo sapiens 
<220> FEATURE: 
<400> SEQUENCE: 12 
Glu Lys Pro Ala Ala Cys Arg Cys 

1 5 
Val Asn Cys Gly Phe Pro Gly lie 

20 



25 



Arg Gin Asn 
15 

Lys Gly Cys 
30 

Tyr Pro Asn 



Ser Arg Gin Asp Pro 
10 

Thr Ser Asp Gin Cys 
25 



Gly Cys Cys Phe Asp Ser Gin Val Pro Gly Val Pro Trp 



35 



Pro 



Leu Pro Ala Gin Glu 
50 

<210> SEQ ID NO: 13 
<211> LENGTH: 318 
<212> TYPE: DNA 
ORGANISM 
FEATURE : 
NAME/KEY 
LOCATION 



Ser 
55 



40 

Glu Glu 



Cys Val 



45 
Met Glu 
60 



Lys Asn Arg 
15 

Phe Thr Ser 
30 

Cys Phe Lys 
Val 



<213> 
<220> 
<221> 
<222> 



Homo sapiens 



<400> SEQUENCE 



gag 
Glu 

1 
acg 



aaa 
Lys 



ccc 
Pro 



tec 
Ser 



aac tgc ggc 



CDS 

(1) 
13 

ccc 

Pro 

5 

ttc 



. . (318) 

tgc cag 
Cys Gin 



tgc tec agg ctg age ccc 
Cys Ser Arg Leu Ser Pro 
10 

ate acc agt gac cag tgt 



cct gga 

Thr Asn Cys Gly Phe Pro Gly lie Thr Ser Asp Gin Cys 

20 25 
gga tgc tgt ttc gac tec agt gtc act ggg gtc ccc tgg 



cat aac agg 
His Asn Arg 
15 

ttt gac aat 
Phe Asp Asn 
30 

tgt ttc cac 



48 



96 



144 
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ZOl 


Gly cys cys fne 


ASp 




ber 


vai 


rp I-. 

i nr 


Giy 


v ai 


fro 


rn -y> y^. 

irp 


cys 


fne 


HIS 


•"i C "i 

ZDZ 










a n 
4 U 










4 D 








264 


ccc etc cca aag 


caa 


gag 


teg 


gat 


cag 


tgc 


'4 — 

g l.c 


a r.g 


gag 


gx.c 


tea 


gac 




Pro Leu Pro Lys 


Gin 


GlU 


ber 


Asp 


Gin 


cys 


17o 1 

vai 


rieu 


GlU 


17=i 1 

vai 


O v 

ber 


ASp 


c. a 


50 
















£ft 

DU 










o c o 
ZOO 


aga aga aac tgt 


ggc 


tac 


ccg 


ggc 


ate 


age 


ccc 


gag 


gaa 


tgc 


gec 


4- _ 4- 

tct 




Arg Arg Asn Cys 


Gly 


Tyr 


Pro 


Gly 


Tin 

i le 




fro 


GlU 


Pin 
<jl U 


cys 


Ala 


ber 


z / U 


65 




70 










n c 
/ -> 










o U 


1 "7 *"> 


egg aag tgc tgc 


ttc 


tec 


aac 


ttc 


ate 


4-4-4- 
ttt 


gaa 


gtg 


ccc 


tgg 


tgc 


4- 4- _ 

ttc 


Z / J 


Arg Lys Cys Cys 


Phe 


Ser 


Asn 


Phe 


± re 


fne 




vai 


fro 


Trp 


cys 


fne 


z / 4 




85 










Q ft 

y U 










y d 




Z / D 


ttc ccg aac tct 


gtg 


gaa 


gac 


tgc 


cat 


tac 


/ 












111 


Phe Pro Asn Ser 


Val 


Glu 


Asp 


Cys 


T J J _ 

HIS 


Tyr 














Z / O 


100 










i nc 
1UD 
















Zo 1 


<210> SEQ ID NO 


: 14 
























1 Q O 
Z O Z 


<211> LENGTH: 105 
























O O *3 

z o J 


<212> TYPE: PRT 


























Z o4 


<213> ORGANISM: 


Homo sapiens 


















ZOO 


<400> SEQUENCE: 


14 
























Zo / 


Lys Pro Ser Pro 


Cys 


Gin 


Cys 


Ser 


Arg 


Leu 


C* -w* 

ber 


Fro 


HIS 


Asn 


Arg 


rn 1*l 

i nr 


TOO 
ZOO 


1 


5 










1U 










Id 




ion 

zb y 


Asn Cys Gly Phe 


Pro 


Gly 


He 


Thr 


C iF% V% 

ber 


Asp 


Gin 


Cys 


fne 


Asp 


Asn 


ciy 


2 y u 


20 










2 D 
















z y i 


Cys Cys Phe Asp 


Ser 


Ser 


Val 


Thr 


Gly 


vai 


fro 


rri -v* w 

l rp 


cys 


fne 


T_T -I 

HIS 


T^) V, yH-N. 

pro 


1 Q T 

z y z 


35 








40 










4 D 








zy j 


Leu Pro Lys Gin 


Glu 


Ser 


Asp 


Gin 


Cys 


val 


Met. 


GlU 


vai 


A Vt 

ber 


Asp 


Arg 


O Q /I 

z y 4 


50 






55 










£ ft 










one 

zy b 


Arg Asn Cys Gly 


Tyr 


Pro 


Gly 


He 


jAl ~h 

Ser 


f*S -A A 

Pro 


GlU 


GlU 


cys 


Ala 


Ser 


Arg 


z y d 


65 




70 










7 ^ 
/ J 










Q ft 


z y / 


Lys Cys Cys Phe 


Ser 


Asn 


Phe 


He 


Phe 


Glu 


Val 


fro 


rp -w~ y-- 

irp 


cys 


fne 


fne 


<-s n o 

2 y o 




85 










90 










y 5 




1 O O 

2y y 


Pro Asn Ser Val 


Glu 


Asp 


Cys 


His 


Tyr 
















o pi r\ 


100 










105 
















"3 C\ O 

JU2 


<210> SEQ ID NO 


15 
























q ft t 

-5 U O 


<211> LENGTH: 540 
























OA/I 

J U4 


<212> TYPE: DNA 


























o c 


<213> ORGANISM: 


Homo sapiens 


















307 


<220> FEATURE: 


























308 


<221> NAME/KEY: 


CDS 
























309 


<222> LOCATION: 


(41) 




[292) 




















311 


<400> SEQUENCE: 


15 














atg 










312 


atccctgact cggggtcgcc tttggagcag agaggaggca 


gec 


acc 


atg 


gag 


313 


















Met 


Ala 


Thr 


Met 


Glu 


314 


















1 








5 


316 


aac aag gtg ate 


tgc 


gee 


ctg 


gtc 


ctg 


gtg 


tec 


atg 


ctg 


gee 


etc 


ggc 


317 


Asn Lys Val lie 


Cys 


Ala 


Leu 


Val 


Leu 


Val 


Ser 


Met 


Leu 


Ala 


Leu 


Gly 


318 




10 










15 










20 




320 


ace ctg gec gag 


gec 


cag 


aca 


gag 


acg 


tgt 


aca 


gtg 


gec 


ccc 


cgt 


gaa 


321 


Thr Leu Ala Glu 


Ala 


Gin 


Thr 


Glu 


Thr 


Cys 


Thr 


Val 


Ala 


Pro 


Arg 


Glu 



192 



240 



288 



318 



55 



103 



151 
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